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Abstract. Waring's classical problem deals with expressing every nat- 
ural number as a sum of g(k) fcth powers. Recently there has been 
considerable interest in similar questions for nonabelian groups, and 
simple groups in particular. Here the fcth power word is replaced by an 
arbitrary group word w 1, and the goal is to express group elements 
as short products of values of w. 

We give a best possible and somewhat surprising solution for this 
Waring type problem for various finite simple groups, showing that a 
product of length two suffices to express all elements. We also show that 
the set of values of w is very large, improving various results obtained 
so far. 

Along the way we also obtain new results of independent interest on 
character values and class squares in symmetric groups. 

Our methods involve algebraic geometry, representation theory, prob- 
abilistic arguments, as well as three prime theorems from additive num- 
ber theory (approximating Goldbach's Conjecture). 
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1. Introduction 

Waring's problem asks whether every natural number is a sum of g(k) 
kth. powers (where g is a suitable function). This was solved affirmatively 
by Hilbert in 1909. Optimizing g(k) has been a central problem in additive 
number theory ever since (see |Na| for more detail and background). 

Recently there has been considerable interest in group theoretic analogues 
of this phenomenon, where the aim is to present group elements as short 
products of certain "special" elements. These special elements can be pow- 
ers, or commutators, or values of a general word w, or elements of a special 
conjugacy class in the group. 

The motivation for this is sometimes topological. Let G be a (topologi- 
cally) finitely generated pro-p group. Using expressions of elements of the 
commutator subgroup G as bounded products of commutators Serre showed 
that G' is closed, and deduced that every finite index subgroup of G is open. 
A recent deep result of Nikolov and Segal |NSlj INS2j shows that this holds 
for every finitely generated profinite group. Again the core of the proof is 
presenting group elements as short products of values of certain words. 

In the realm of finite simple groups such Waring type problems have been 
studied in even greater detail. We need some notation. 

Let w = w{x\, . . . , Xd) be a non-trivial group word, namely a non-identity 
element of the free group F d on x±, . . . , x d . Then we may write w = 
x^x^ ■ ■ ■ x™* where ij £ {1, . . . , d}, rij are integers, and we may assume 
further that w is reduced. Let G be a group. For gx, . . . ,g d E G we write 

w(gi,...,g d ) = g^g* 2 ■ ■ ■ g% G G. 

Let 

w(G) = {w(gi,. .. ,g d ) : gi,... ,g d e G} 

be the set of values of w in G. For subsets A, B C G let AB = {ab\a € 
A,b £ B} and A k = {m ■ ■ ■ a k \ai G G}. 

Fix a non-trivial group word w and let G be a finite simple group. If G 
is large enough then it follows from Jones [J] that w(G) ^ {1} (namely w 
is not an identity in G). Can we then find a constant c (which may depend 
on w but not on G) such that w(G) c = Gl 

Various instances of this problem have been solved affirmatively in the 
past decade or two. See Wilson [W] for the commutator word w{x\,X2) = 
[xi,rr2] = x^ 1 X2 1 xiX2, and Martinez and Zelmanov |MZ] and Saxl and 
Wilson [SW] for power words w = x\. It follows from their result that 
every element of a large enough finite simple group is a product of f(k) kth 
powers. 

In [LiSh] this is generalized to arbitrary words w. It is proved there that 
if w(G) / {1} then w(G) c = G, where the exponent c = c(w) depends on 
w (but is not given explicitly). A somewhat surprising new result from |Sh] 
is that the exponent c can be chosen independent of the word w and may 
be, in fact, a very small number. More specifically for every word w ^ 1 
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there is a positive integer N = N(w) such that for every finite simple group 
G with \G\ > N(w) we have 

w(Gf = G. 

The main purpose of this paper is to prove an even stronger result for 
certain families of finite simple groups, where the exponent 3 is replaced by 
2 (see Theorems 11.51 and 1 1 . 6 1 b elow) . Along the way we prove several results 
of independent interest, related to the size of w(G), and to powers of certain 
conjugacy classes C in G. 

Given a conjugacy class C / 1 in a finite simple group G there exists 
a number k such that C k = G. Substantial work has been devoted to the 
study of these numbers k, and related so-called covering numbers (see e.g. 

[AH EghJ mi Est])- 

A particular challenge is to show that C 2 = G in certain cases. Indeed a 
conjecture of Thompson states that every finite simple group G has a class 
C with this property. In spite of considerable progress this is still open in 
general. 

Now let C = a Sn be a conjugacy class in S n . When can we say that 
C 2 = A n l This problem has quite a long history. Gleason |Hu] seems to 
have been the first to observe that C 2 = A n if a is an n-cycle. See also 
Bertram [Be]. Later this was generalized by Brenner [Br] to the case where 
a consists of two cycles (or more generally two non-trivial cycles with some 
additional fixed points). The case of permutations with more general cycle 
structure remained wide open. 

Our first result deals with permutations with any given number of cycles, 
provided n is sufficiently large. We denote the number of cycles (including 
1-cycles) of a permutation a by cyc(a). 

Theorem 1.1. For every positive integer k there is a number f(k) such that 
ifn> f(k) and a is a permutation in S n with cyc(cr) < k then (a Sn ) 2 = A n . 

Moreover, there is an absolute constant c such that, if n > c and a is a 
permutation in S n with cyc(cr) < n 1 / 128 , then (a Sn ) 2 = A n . 

By the Erdos-Turan theory (see |ET] ) . a randomly chosen permutation 
in S n has O(logn) cycles. This gives rise to the following result. 

Corollary 1.2. (i) The probability that a randomly chosen permutation 
a € S n satisfies (a ") 2 = A n tends to 1 as n — > oo. 

(ii) The probability that a randomly chosen permutation a S A n satisfies 
(a An ) 2 = A n tends to 1 as n — > oo. 

Indeed, part (i) follows immediately, and part (ii) follows from (i) using 
the fact that for almost all a G A n we have a A " = a Sn (see [Shj . 3.3). 

Corollary 11.21 sharpens a previous result showing that for almost all a G 
A n we have (a An ) 3 = A n (see Theorem 2.6 in [Shj ) . 

It is intriguing that the proof of Theorem 11.11 is non-elementary in the 
sense that it involves probabilistic arguments and new character-theoretic 
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estimates. In particular we show that all character values x{°~) °f a permu- 
tation a S S n can be bounded in terms of cyc(a) alone (see Theorem 17.2ft . 

In order to apply Theorem 1 1.1 1 to Waring type problems we need to show 
that the image w{A n ) of a word map id / 1 contains conjugacy classes C 
with few cycles. 

Theorem 1.3. For every positive integer n there exists a permutation a n G 
A n with the following properties: 

(i) cyc(cr n ) < 23; 

(ii) For each non-trivial word w there exists N = N{w) such that ifn > N 
then a n G w(A n ). 

We can also arrange that, for n sufficiently large, the permutation a n has 
at least six fixed points. 

Our proof of Theorem 11.31 is highly non-elementary, involving algebraic 
geometry and results from analytic number theory (such as weak versions of 
the Goldbach Conjecture). The idea is to embed groups of the form SL2(F p ) 
and their products in A n , and then focus on the properties of word maps on 
SL 2 (F P ). 

It would be interesting to try to find an elementary proof of Theorem 11.31 
as well as to improve it. In this context we pose the following: 

Problem 1.4. What is the minimal number k such that (for w ^ 1 and 
n > N{w)) w(A n ) always contains permutations with at most k cycles? 
Does w{A n ) necessarily contain a cycle of length n — k for some k which 
may depend on w but not on n? 

Equipped with the tools above, we can now improve earlier Waring type 
results for alternating groups, showing that w{A n ) 2 = A n if n S> 0. In fact 
we prove a bit more. 

Theorem 1.5. For each pair of non-trivial words wi,w 2 there exists N = 
N(wi,w 2 ) such that for all integers n > N we have 

w 1 (A n )w 2 (A n ) = A n . 

The idea of the proof is to combine Theorems 11.11 and 11.31 noting that 
erf" C Wi(A n ) for i = 1, 2 and (cr^") 2 = A n . 

We next establish a similar result for certain finite simple groups of Lie 
type. Indeed, if we limit our attention to groups of Lie type of bounded 
dimension, we can prove the following result: 

Theorem 1.6. Given an integer d and two non-trivial words w\ and w 2 
there exists an integer N = N(d, w±,W2) such that if G is a simply connected 
almost simple algebraic group of dimension d over a finite field ¥ , V = 
G(¥)/Z(G(¥)) is the finite simple group associated to G over¥, and \T\ > N , 
then we have 



w x {Y)w 2 {V)=Y. 
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The method of the proof is to obtain a suitable surjectivity theorem at 
the level of algebraic groups, and then to use the Riemann hypothesis for 
varieties over finite fields to show that the relevant fibers actually have points 
over F. 

As a consequence we obtain the following. 

Corollary 1.7. For each triple of non-trivial words w\, 1112,11)3 there exists 
N = N(wi,W2,W3) such that if G is a finite simple group of order at least 
N , then 

Wl (G)w 2 (G)w 3 (G) = G. 

Indeed, the case of groups of Lie type appears in [Shj . and Theorem 11.51 
above completes the proof by dealing with alternating groups. 
We conjecture this can be strengthened as follows. 

Conjecture 1.8. For each pair of non-trivial words W\,W2 there exists N = 
iV (101,102) such that if G is a finite simple group of order at least N, then 

Wl {G)w 2 {G) = G. 

In view of our results above it remains to prove this for classical groups 
of unbounded rank. This seems quite challenging even in the particular case 
W\ = W2 = x\ . 

Let us now discuss the size of the subset w{G). A related useful result 
of Borcl [Bo] shows that the word map induced by w on simple algebraic 
groups is a dominant map. In [L] this is used to show that if G is a finite 
simple group and e > then 

\w(G)\ > |G| l - 

provided \G\ > f(w,e). In this paper we improve this bound for various 
families of finite simple groups. 

We start with alternating groups. Often the key to estimating the size of 
w(G) is finding special elements in it. 

Indeed, using Theorem 11.31 and the remark following it one can easily 
deduce that, for n > N(w), 

\w(A n )\ > cn- 17 \A n \ 

for some absolute constant c > 0. In fact, we can provide a much better 
estimate: 

Theorem 1.9. For each non-trivial word w and every e > 0, there exists 
N = N(w, e) such that if n > N then 

\w(A n )\ > n~ A - e \A n \. 

We can show that this bound is tight up to the value of the exponent. 
Indeed, consider a power word w(x\) = x\. Then by [L2] we have 

MAO I < \w{S n )\ < an- b n\, 
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where b = 1 — 4>(k)/k and a is some constant (depending on k). Note that, 
choosing k suitably (e.g. as the product of the first m primes) we may 
arrange that b > 1 — e for any fixed e > 0. This shows that the exponent in 
Theorem 11.91 must be at most —1. 

It would be interesting to find out whether Theorem 11.91 can be improved 
as follows. 

Problem 1.10. Is it true that \w(A n )\ > n _1 |A„| provided w ^ 1 and 
n > N{w)? 

While we are unable to settle this for alternating groups, we obtain an 
analogous result for certain simple groups of Lie type. 

Theorem 1.11. Let G be a finite simple group of Lie type and of rank r. 
Let w / 1 be a word. Then if G is not of type A r or 2 A r , we have 

\w{G)\ > cr^\G\ 

for some absolute constant c > 0, provided \G\ > N(w). 

It is interesting that our methods allow us to generalize various results on 
w(G) to intersections of the form n* =1 iOi(G), where w\, . . . , Wk ^ 1 are any 
given words. These generalizations are formulated in the last section of this 
paper. 

Some words on the structure of this paper. Section 2 is devoted to groups 
of Lie type and the proof of Theorem II. Ill In Section 3 we focus on Lie type 
groups of bounded dimension and prove Theorem ll.61 In Section 4 we make a 
closer analysis of groups of the type SL2(F p ). Combined with a certain three 
primes theorem this leads to the proofs of Theorem 11.31 and Theorem 11.91 in 
Section 5. Section 6 contains the elementary part of the proof of Theorem 
11.11 In Section 7 we study character values of S n on permutations with few 
cycles and squares of conjugacy classes of such permutations. In this way 
we obtain a probabilistic proof of Theorem 11.11 Finally, in Section 8 we 
formulate results on the intersection of the images of different word maps, 
and show how to derive them using our methods. 

The first-named author would like to thank Nicholas Katz for some useful 
correspondence. 

2. Word values in groups of Lie type 

In this section, we use algebraic geometry and group-theoretic arguments 
to give lower bounds on |u>(r)| for certain groups T of Lie type, thus proving 
Theorem 11.111 Note that the bounds obtained are actually stronger than 
those we give for alternating groups. Although in the rest of the paper, G 
denotes a finite simple group, in this section and §3, G will always denote a 
simply connected, almost simple algebraic group defined over a finite field. 
To get a finite simple group T, we divide G(¥ q ) by its center. All simple 
groups of Lie type arise in this way, excepting the Suzuki and Ree groups. 
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The Suzuki and Ree groups can also be treated by the methods of this 
section, but we exclude them to avoid unpleasant technicalities. 

Proposition 2.1. Let w be a non-trivial word and G a simply connected 
almost simple algebraic group over a finite field ¥ q . Then there is a positive 
constant c depending only on w and dim G such that 

\w(G(¥ q ))\>c\G(¥ q )\. 

This proposition is proved as [Ll Prop. 7] in a form that also includes the 
case of Ree and Suzuki groups. Here we give an alternative proof that uses 
standard techniques from algebraic geometry instead of results from [LP] . 

We use the following more or less standard result: 

Lemma 2.2. Let ir: X —* y denote a dominant morphism between reduced 
schemes of finite type over Z. Then there exist constants c\ and C2 such that 
for every prime p and every point y £ y(¥ p ), we have 

\{xeX(¥ p ):7r(x) = y}\<c lP di ^- lx 
and i/7r _1 y is non-empty and geometrically irreducible, then 
\{x € X(¥ p ) : vr(x) = y}\ > p dim -^(l - c^' 1 ' 2 ). 

Proof. By the Lefschetz trace formula, these results are immediate conse- 
quences of standard facts about etale cohomology groups: the boundedness 
of the compactly supported cohomology groups of the fibers of tt, the trace 
map on the top-dimensional cohomology of (geometrically irreducible) va- 
rieties, and the weight estimates for the eigenvalues of Frobenius on these 
cohomology groups (see |SGA 4 3 [ XVIII 2.9] and [De, 3.3.1].) □ 

We now prove the proposition. 

Proof. An algebraic group G of this kind is specified by a connected Dynkin 
diagram A, an automorphism of this diagram (which is determined up to 
inner automorphism by its order g < 3), and a finite field ¥ q . It suffices to 
prove this proposition for each fixed choice of A and g. 

First we treat the split case, g = 1. In this case, A determines a simply 
connected Chevalley scheme Q over Spec Z of which G is the F ? -fiber, and 
w determines a morphism of schemes Q d — > Q which we denote tt w . If ir w 
has all fibers of dimension < {d— 1) dimG, then we are done by Lemma l2.2l 
since |G(F g )| > ciq dimG and each fiber has at most C2^ d ~ 1 * > dimG points of 
G(¥ q ) d for some positive constants c\ and C2 independent of q. The difficulty 
is that it might happen, a priori, that most points in G(¥ q ) d lie in fibers of 
tt w of dimension greater than {d — 1) dimG. 

To show that this does not in fact happen for large q we note first that 
ir w is dominant; moreover, it is dominant for every fiber of Q d over Spec Z 



Bo] . Fiber dimension is a constructible function [E GA IV3 9.5.5], so the 
Zariski closure of the set of geometric points s for which 

dim-Tr" 1 ^) > (d — 1) dimG 
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is a proper closed subset of Q. Let X denote its inverse image in Q d . By 
[Bo] . X is a proper closed subvariety of Q d for all p which does not contain 
the generic point of any fiber of Q d over Spec Z. By Lemma 12. 2\ for all 
q S> 0, at least half of the points of G(¥ q ) d do not lie in X(¥ q ), and the 
proposition follows. 

Finally, we treat the case g > 2. Instead of defining a split group scheme 
Q over Spec Z, we define a non-split group scheme it: Q — > S for some 
higher-dimensional base S such that Q splits over a finite etale Galois cover 
S' over S of degree g. For example, we may take 

S = Spec Z[l/g][x 9 , x~ 9 } JJ Spec ¥ g [x 9 - x] 

and 

S' = Spec Z[l/g][x, Spec F 3 [x]. 

The crucial point is that the base is large enough that for every ¥ q there 
exists an ¥ q - valued point s of S whose preimage in S' is ¥ q g, since this 
guarantees that every G/¥ q with the given Dynkin diagram and diagram 
automorphism can be obtained as the fiber of Q for some ¥ q point of S. 

□ 

We can now prove Theorem 1 1.111 

Proof. The case of groups of type E, F, and G and of classical groups of 
any fixed dimension follows from Proposition 12.11 We need therefore con- 
sider only orthogonal groups and symplectic groups. We begin with the 
symplectic case. 

Let V be a 2-dimensional vector space over the field ¥ q r. We endow V 
with an area form A. We define a bilinear form on V, regarded as a vector 
space over ¥ q , as follows: 

(vi,v 2 ) = tr ¥qr/¥q A(vi Av 2 ). 

This form is obviously antisymmetric and non-degenerate. It therefore de- 
termines an inclusion 

%: SL 2 (F g ,) - Sp 2r (F g ). 

If x E SL2(F g r) has eigenvalues X^ 1 , then i{x) has eigenvalues X ±qk , k = 
0, 1, 2, . . . , r — 1. As long as these eigenvalues are pairwise distinct, i(x) is a 
regular semisimple symplectic matrix. This can be achieved by insisting for 
every proper subfield ¥ q d, A belongs neither to ¥* d nor to the norm-1 sub- 
group of ¥ q 2d over ¥ q d . The subset of SL2 (¥ q r ) consisting of elements which 
violate either condition has cardinality 0(q 3r ^ 2 ), so by Proposition 12.11 we 
can find at least ciq 3r elements of w(SL2(¥ q r)) which are regular semisimple 
in Sp 2r (F g ). If semisimple elements a, b € SL2(F g r) have eigenvalues a , 
(j^ 1 respectively, then i(a) and i{b) have the same eigenvalues if and only 
if q = (3 q% or a" 1 = (3 ql for some i G {0, 1, ... ,r — 1}. Thus there are at 
least C2r~ 1 q r distinct conjugacy classes of regular semisimple elements in 
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i(w(SL2(¥ g r))) c w(Sp 2r (Fg)). The centralizer of any regular semisimple 
element i(a) in Sp 2r (Fq) coincides with the image of the centralizer of a in 
SL 2 (F g r); it is therefore of order q r ± 1. Thus, 

\i(w(SL 2 (¥ q r)))\ > C2r -i_«_|Sp 2r (F g )|. 

This proves the theorem for groups of type C. 

For the cases B and D, we begin by observing that given a vector space 

V over ¥ q k and a non-degenerate quadratic form Qq : V — ► ¥ q k defined over 
¥ q k, we can define a non-degenerate quadratic form on V regarded as an 
F 9 -vector space as follows: 

Q(v) = t?¥ qk /¥ q Qo{v). 

This gives an inclusion Spin(V, Qq) < Spin(V, Q). Composing this map with 
an isomorphism SL 2 (F 2k) = Spm^(¥ q k), we obtain an inclusion 

(2.1) SL 2 (¥ q2k ) - Spin± (F ? ) 

for some choice of signs. By the classification of quadratic forms over finite 
fields, every space W of dimension > 4 with a non-degenerate quadratic form 
over ¥ q can be decomposed as W\ _L W% , where W\ is isomorphic to a space 

V obtained as above and dimW^ < 6. We choose such a decomposition, 
and let i denote the map 

SL 2 (F 9 2 fc ) x Spin(W 2 ) -> Spin(VF) 

obtained by combining (|2.ip with the natural map Spin(PFi) x Spin(F4 / 2 ) — » 
Spin(VF). Note that i is at most two-to-one since SO(Wi) x SO(Wz) — > 
SO(VF) is injective. If k > 4, 6 G Spin(M / 2 ) is regular semisimple, and the 
eigenvalues of a semisimple element a £ SL 2 (F g 2fc) are not contained in a 
proper subfield of ¥ q 2k , then the image of i(a, b) in SO(VF) has all eigenvalues 
distinct and is therefore regular semisimple. By Proposition ^. 11 the number 
of such pairs (a, b) with a £ w(SL2(¥ q ik)) is at least 

„, dim W 2 dimtV 

c 3 q ^2 = c 3 q 2 , 
and the number of semisimple conjugacy classes in w(Spin(W)) determined 

1 i —1 dim W 

by such pairs is at least c^r q 2 . For q greater than some constant C5 
(independent of k), we can further specify that b £ w(Spin(W / 2 )) at the cost 
of replacing C4 by cq. The centralizer of i(a,b) in Spin(F4^) is a maximal 
torus of Spin(PF) containing a maximal torus of Spin(Wi); it therefore has 
at most 

, ou ■. , v dim W 2 dim W 

(9 + + 1)^ - < 

elements. We conclude that the total number of elements of Spin(VF) con- 
jugate to elements of i(SL 2 (F g2 fe) x Spin(W 2 )) is at least | £ r~ 1 |Spin(VF)|. 

Finally, we consider the case that q < C5. In this case, we take b = 1, so 
i(a,b) is not regular; its centralizer is Spin(M / 2 ) times a maximal torus of 
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Spin(M^i). The order of the centralizer is therefore less than c^ 111 2 q 2 , 
and the argument goes through as before. 

□ 



3. WARING'S PROBLEM IN BOUNDED DIMENSION 

In this section we focus on simple groups of Lie type G in bounded di- 
mension. Using geometric methods and establishing the irreducibility of 
certain fibers (see Theorem 13.31 below), we obtain a best possible solution 
of Waring's problem for such groups, namely w(G) 2 = G provided |G| is 
large enough. Moreover, we prove a somewhat stronger result, namely The- 
orem [L6] dealing with two different words. 

We begin with a basic geometric lemma: 

Proposition 3.1. Consider the diagram 

X 




Y 

of affine varieties over an algebraically closed field k. IfY is normal, f is 
generically smooth and surjective, and g factors through f at the level of 
closed points, then g factors through f as a morphism of algebraic varieties. 

Proof. Let A, B, and C denote the coordinate rings of X, Y, and Z re- 



spectively. They are Jacobson rings |EGA IV3, 10.4.6], so that every non 



empty locally closed subset of X, Y, or Z contains at least one closed point 
|EGA IV 3 [ 10.1.2]. Let K and L denote the function fields of X and Y 
respectively. We have the diagram 



Surjectivity of / implies that every prime ideal of B contains kercftf, and as 
B is an integral domain, this means that <pj is injective. We regard B as a 
subring of A via cpj. Our goal is to prove that 4> g {C) C B. Suppose not. 
Let h £ A denote an element of <p g {C) \ B. Thus / factors through 

fh : Spec B[h] — ► Spec B. 

We consider four cases: 

(1) L(h) = L. 

(2) L(h) is an inseparable extension of L. 

(3) L(h) is a non-trivial separable extension of L. 

(4) L(h) is a transcendental extension of L. 
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In case (1), as B is normal and h £ L\B, there exists a prime ideal p of 
B of height 1 such that h Bp \Ma\ Th. 38]. In other words, p does not lie 
in the image of fh, contrary to the surjectivity hypothesis on /. 

In case (2), the extension L — > K is inseparable, contrary to the hypothesis 
that / is generically smooth. 

In case (3), the morphism X — ► Spec B[h] is dominant (since B[h] C j4). 
By |EGA IV4I 17.6.1], there exists a dense open set U± C Spec B[h] for which 
the morphism fh is etale and by |EGA IVil 1-8.4], a dense open set U2 C 

C 

Spec B[h] contained in the image of X in Spec B[h\. Thus fh(UfU U!j/) 
is a non-empty open set in Spec B over which is etale and every point 
in Spec B[h] lies in the image of X. By E GA IVat 15.5.9], by passing to 



a smaller open set if necessary, the cardinality of every fiber of fh equals 
[L(h) : L\. It follows that there exist closed points xi,£2 £ X with the 
same image in Spec B but distinct images in Spec B[h}. If mi and rri2 are 
the corresponding maximal ideals of A, then h maps to different elements 
in A /mi = k and A/m2 = k. Adding a suitable element of k to h, we 
may assume that h maps to in A/ mi but not in A/m2 and therefore some 
element of C maps to mi but not to m2- It follows that xi and %2 have 
distinct images in Spec C, contrary to hypothesis. 

The argument is the same in case (4), except that Spec B[h] — > Spec B 
has the property that the inverse image of every point in Spec B is infinite. 

□ 

Proposition 3.2. Let G be a simply connected, almost simple algebraic 
group over an algebraically closed field k. If X C G is a conjugacy class of 
positive dimension, then for all sufficiently large integers n, the morphism 
7r n : X n — > G obtained by multiplying coordinates in G is generically smooth 
and surjective. 

Proof. Let Xi denote the Zariski closure of 7Tj(X*). As X 1 is irreducible, 
X{ is the closure of a single point and therefore irreducible. The sequence 
dimXi, dim X2, ... is non-decreasing and must therefore stabilize, so we may 
assume dimX„ = dimX n+ i = • • • . If xi,X2 £ X(k), then 

xiX n U x 2 X n C X n+ i, 

so xiX n = X2X n , and 

(3.1) x^ l x 2 X n = X n . 

Let Yi denote the Zariski-closure of 4>i{X 2t ) (or, equivalently, the closure 

of cf)i(X{k) 2i ), where 

4>i(xi, ... , X 2 i) = X^ 1 X 2 X^ 1 X 4 ■ ■ ■ X^l_ x X2i- 

If n is sufficiently large, dim = diml^+i, so 

x l X'iXm = 2-1 x 2^rai 

or 
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for all X2,x' 2 S X(k). This implies that Y m Y m = Y m and therefore Y m is a 
connected algebraic subgroup of G. As X is conjugation- invariant, Y m is a 
normal subgroup, and it follows that Y m = G. By (13. ip . 

A n — Y rn X n — GX n — G . 

Let denote the Zariski closure of the complement of TTi(X l ). For i> n, 
Z{ is a proper closed subvariety of G. If x\,Xi £ X(k), then 

Z n+ i dx\Z n V\ x 2 Z n . 

If xiZ n = x<iZ n for all x±,X2 € X(k), then Z n is invariant under translation 
by Y m = G, which is impossible. Thus x\Z n ^ X2Z n for some x\,X2 G 
X(k). If we form the vector of dimensions of irreducible components of Z n , 
arranged from largest to smallest, then the vector of Z n majorizes that of 
Z n +i- We conclude that Z n = for n 3> 0, and this gives surjectivity. 

For generic smoothness, it suffices to find points x\, . . . ,x n £ X(k) such 
that the map G n — > G given by 

(ffl, • • • ,9n) ^ ^lZlSr^^&T 1 " ■■9nXn9n l 

is smooth at the identity in G n . Equivalently, we must prove that the space 

L(xt, ...,x n ):=(l- ad(xi))(g) + ad(xi)(l - ad(x 2 ))(fl) H 

+ ad(xi • • • x n _i)(l - ad(x n ))(s) 

equals g for suitable x±, . . . ,x n . As 

L(x 1 , . . . , x n ) = (1 - ad(si))(fl) + ad(x 1 )(L(x 2 , . • • , x n ))> 

we have 

dimL(xi, . . . , x n ) > dimL(x2, . . . , x n ) 
with equality if and only if 

(ad^ 1 ) - l)(g) C L(x 2 , x n ). 

Now, 

S x ■= Span zeX (ad(x _1 ) - l)(g) 
is invariant under the adjoint action of G. As x is not in the center of G, 
Sx 7^ 0. If the adjoint representation of g is irreducible, which is usually 
the case since G is almost simple, it follows that Sx = 0- To see that this 
is true in general, we observe that since every element of G is a product of 
elements of X, 

S x = Span 9gG (ad(sO - l)(g), 

and G acts trivially on g/Sx- As G is simply connected, there is no non- 
trivial quotient of g on which G acts trivially (see, e.g., [E] 1-H])- 

Therefore, the dimension determined by a sequence in X can always be 
increased by prepending a suitable element of X unless it already equals 
dimg. The proposition follows. □ 

We can now prove our basic irreducibility theorem: 
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Theorem 3.3. Let w\ and wi denote non-trivial words in m and ri2 letters 
respectively, and letw £ F ni + n2 denote their juxtaposition. Let G be a simply 
connected almost simple algebraic group over an algebraically closed field k. 
Then for all non-central elements g £ G(k), the fiber ir^ 1 ^) is irreducible. 



We remark that the hypothesis that G is simply connected is really 



needed. Otherwise, we can take w± = x\X2X l x 2 and u>2 = x^X4X 3 
and the resulting morphism ir w factors through the universal cover of G. 



Proof. We can express the fiber ir~ 1 (g) as the fiber product over G of tt Wi and 
a second morphism, namely the composition of tt W2 with the "reflection" map 
x i — > gx^ 1 . We will prove that this fiber product is geometrically irreducible. 

Let A denote the coordinate ring of G and K the field of fractions of A. 
Let Ki denote the separable closure of K in the fraction field of A® ni , where 
the morphism A — ► A® ni is that associated to ir Wi . Let Li be the splitting 
field of Ki over K and A, : K — > Li the natural inclusion map. 

For any g £ G(k), we define reflection, conjugation, and translation maps 
G — > G as follows: 



Pg\ 



gx 



-i 



Xg(x) = gxg \ T g (x) = gx. 



The induced automorphisms of A and K are also denoted p g , Xgi an d t 5 
respectively. Let 



X 9 ,i(xi,...,x ni ) = (gxi 



,gx n ,g 



We claim that if g does not lie in the center of G(k), then Ai and \20pg give 
linearly disjoint finite extensions of K. As A® ni is geometrically irreducible 
over Ki |EGA IV2, 4.5.10] and fiber products of geometrically irreducible 
schemes over a field are again geometrically irreducible |E GA FV^t 4.5.8], this 
claim implies the theorem. As L\ and L2 are Galois over K, it is equivalent 
to prove that there do not exist subfields K\ and K2 of L\ and L2, properly 
containing K, such that K\ and K2 are isomorphic as /C-extensions. 

Suppose, on the contrary, that such an isomorphism l: K\ — » K2 exists. 
The injective homomorphisms A« factor through Ki, and we write re, for 
the corresponding homomorphisms K ^ Ki. Then we have the following 
diagram, in which the horizontal maps are isomorphisms and the vertical 
maps are finite field extensions: 



Xh,l ~ , ~ Xh.2 _ /,-! 

K\ K x K 2 K 2 K 



K 



Kl 


Kl 


12 


12 










r Xh 


, p 3 




, e* 1 r 



K 



K 



K 



1 



K. 
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The composition of horizontal arrows gives a diagram 



K 



hgh 1 g 



K, 



where a g ^ is an isomorphism depending on g and h. 

Let A denote the integral closure of A in K\. As A is finitely generated 
over the field K, it is a Japanese ring [EGA IV2 , 7.7.4], so A is a finitely gen- 
erated ^4-module. The map Ki restricts to an injective ring homomorphism 
A — > A; by a slight abuse of notation, we denote by K\ also the correspond- 
ing homomorphism of affine varieties G —* G, where G = Spec A. We fix 
e 6 G(fe) lying over the identity in G. For every /ii, /12 £ G, <r s ^ and X/i 2 ,l 
induce automorphisms of G. Fix h\ which does not commute with g and 
let \i2 vary over G. By taking the commutator of <r g ji\ an d Xh 2 ,i we obtain 
a morphism S:IxG^G, where X is the conjugacy class of high^ 1 g^ 1 , 
and each point of X gives an automorphism of G which covers the corre- 
sponding translation of G. We write T, n for the morphism X n x G — > G 
defined recursively by 



S n (xi,... ,x n ,y) = S(xi,S„_i(x 2 ,... ,x n ,y)), 
for n > 2 and Si = S. Let S„ )e : X n — > G be defined by 

Sn,e(-£l; • • • > Xn) — . . . , X n , e). 



The composition S nje oKi gives the restriction to X n of the usual n-fold 
multiplication morphism G n — * G. 

By Proposition 13.21 the composition of X njf > : X n — > G and G — > G is 
surjective and generically smooth for all sufficiently large integers n. It 
follows that S n)e itself is generically smooth and dominant. If the Zariski- 
closure Z n of the complement of the image of S nje is non-empty, then there 
exist xi,X2 G X(k) such that Z n ) 7^ £(#2, ^n) (because the images of 

these two subvarieties of G in G are distinct). As 

z n+ i c z n ) n z n ), 

we see the vector of component dimensions of Z n majorizes that of Z n . For 
n sufficiently large, therefore, Z n is empty, i.e., S„ )e is surjective. 
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We would like to apply Proposition 13.11 to the upper triangle of the dia- 
gram 

(3.2) 




We observe first that the pentagon of solid arrows in (|3.2p commutes because 
the diagram 



X x G 



idx/ti 



X x G 



G 



Kl 



G 



commutes. The variety G x G is normal since G is so by construction. To 
check that there exists a morphism jl at the level of closed points, we note 
that if x\, . . . , x n belong to X(k), the resulting map £ n (xi, . . . , x n , •) is an 
automorphism of G which covers translation by x\ ■ ■ ■ x n . Our claim asserts 
that, at least at the level of closed points, this automorphism depends only 
on S nje (xi, . . . , x n ) and not on the actual n-tuple (xi, . . . , x n ). Indeed, if we 
fix g £ G and let g = Ki(g) there is a unique automorphism of G covering 
T g and sending e to g. Thus, we may apply Proposition 13.11 to define jl. 
The resulting diagram commutes (i.e., the lower rectangle does so) because 
it commutes at the level of closed points. 

We claim that the arrow fx in f|3.2|) is a multiplication morphism making 
(G, e) into an algebraic group and G — > G into a central isogeny. As G is 
simply connected, this implies that k\ is an isomorphism |Ti| 1.5.4], and 
therefore that K = Ki, which proves the theorem. 

To prove that jl satisfies the associativity axiom, we compare the two 
maps G 3 —> G given by ji(jj,(x , y) , z) and jl{x, jl(y, z)). We have a diagram 




The closed subvariety of G 3 on which the two arrows agree is also open since 
the fibers of G — > G are finite. It is non-empty since it contains (e, e, e). As 
Cr is connected, jl is associative. 

Next we construct the inverse on G. We choose an automorphism of G 
which covers the composition r^-iop^ (which maps x i— > x^ 1 ). This auto- 
morphism sends e to some element of G lying over the identity of G. By 
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composing with a deck transformation, we can find a new automorphism of 
G which sends e to itself and covers the inverse map on G. Again we use 
connectedness to show that this morphism satisfies the diagram for inverse 
maps with respect to fi. Thus (G, £l, e) is an algebraic group and the cov- 
ering map G — > G is a surjective homomorphism, i.e., an isogeny. As it is 
separable, it is central. The theorem follows. 

□ 

We can now deduce Theorem 11.61 

Proof. As in the proof of Proposition 12.14 it suffices to consider a fixed 
Dynkin diagram A and a group scheme tt: Q — > S such that every simply 
connected, almost simple algebraic group G/F q with Dynkin diagram A is 
isomorphic to Q s for some s £ S(¥ q ). Next we consider the morphism 

tt w : G ni+n2 -> Q 

(where the fiber power is taken relative to S) given by a word uu which 
is a juxtaposition of the words w\ and u>2 taken in disjoint variables. If 
s £ S(¥ q ) and x G G s (¥ q ) is a non-central element, then by Theorem 13.31 
the fiber of ir w over s is geometrically connected, so by Lemma [2. 2 \ if q 3> 0, 
x £ w(Q s (¥ q )). If T denotes the quotient of G s (¥ q ) by its center, it follows 
that the image of x in T lies in w(T). This accounts for all elements of T 
except the identity, which trivially lies in w(T). 

□ 

4. Special word values in SL 2 (F p ) 

In this section we take a closer look at 2-dimensional special linear groups, 
which play a key role in our results for alternating groups. Our main result 
below proves the existence of elements of very high order in iu(SL2(F p )), 
under certain conditions on p. 

Theorem 4.1. For every non-trivial word w there exist constants M w and 
N w such that for every prime p > N w such that p — 1 is not divisible by 
4 nor by any prime 3 < I < M w , there exists an element of order in 
w(SL 2 (¥ p )). 

Proof. Our main goal is to find s £ SL^Fp) satisfying the following condi- 
tions: 

• There exists t G SL2(¥ p ) d such that ir w (t) = s. 

• The centralizer of s is a split torus. 

• There are two square roots of s in SL^Fp) 

• If k > 1 is not divisible by any prime < M w , then s has at most 
k — 1 fcth roots in SL^Fp). 

Let Pk denote the fcth Chebychev polynomial, normalized so that 

P k (x + x" 1 ) = x k + x~ k . 
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Note that Pk{x) G Z[x]. If A € F p \ {0} and k is an integer such that 
X k / A~ fc , then the conditions {A fc , A + A -1 } C ¥ p and A G F p are equiva- 
lent. Therefore, the specified conditions on s are equivalent to the following 
conditions on the trace u = tr(s): 

• The equation tr(7r w (t)) = u has at least one solution t £ SL^Fj,)^. 

• The equation x + x^ 1 = u has two solutions in F p . 

• The equation P2{x) = u has two solutions in ¥ p . 

• If k > 1 is not divisible by any prime < M w , then the equation 



Pk( x ) = u has at most k 



1 solutions x € F p . 



Let 



SL 2 = Spec Z[au, ai2, 021, 022]/(ona22 - 012021 - 1) 



and A 1 = Spec Z[x}. Let X\ denote the integral closure of A 1 in SL^, 
and write ip : SLf[ — > X\ and X\ — > A 1 for the natural maps whose 
composition r : SL2 — > A 1 gives the trace of the morphism tt w associated to 
w: 




A 1 . 



The generic fiber of is geometrically connected. By the openness of geo- 
metric connectivity [EG A IV^ 9.7.7], there exists a non-empty subscheme 
U C A 1 such that V'lr- ^rH^O has geometrically connected fibers. 
Let 

X 2 = Spec Z[x, y]/(y 2 - x 2 + 4), 



and let (p2 : X% — > A 1 denote the map (x, y) 1— > x. Let ^3 = A 1 for all 



positive integers k, and 



4f>. 



: X 



A 1 be the Chebychev morphism given 



by Pfc. The schemes X\, X2, and X^' are generically finite over A 1 , so 
shrinking IA if necessary we may assume that over U, all three are finite 
etale. Let M w denote the degree of 4>\. If N w is sufficiently large, for every 
prime p > N w , the F p -fiber of IA is non-empty. For some L w independent of 
p, for every p > N w , ¥ p \U(¥ p ) has at most L w points. 

(k) 

We now fix p sufficiently large and let Ui, U2, and C/3 denote the inverse 

(k) 

images of the F p -fiber lAf p in Xif p , X^ p , and X^ ¥ respectively. 
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We define the various fiber products of these curves as follows: 



U 



123 



Ul 




23 



(k) 



(k) 



We now rewrite our original set of conditions on u G F p (which we now 
assume lies in the subset U(F P )): 

• There exists u\ G U\{¥ p ) mapping to u. 

• There exists U2 G ^(Fp) mapping to u. 

(2) (2) 

• There exists u 3 £ (F p ) mapping to u. 

• If k > 1 is not divisible by any prime < M w , the number of points 

G [/^(Fp) mapping to u is less than k. 

The first condition may appear to be weaker than first of the previous 
set of conditions on u, but for p » 0, they are equivalent. This is because 
the fibers of ip over all points in Ui(¥ p ) are geometrically connected, so by 
Lemma 12.21 they all have F p -points when p > 0. The second and third 
conditions mention only one preimage instead of two, but for two-to-one 
finite etale maps, there is no difference. 

We now suppose that k is odd and 2k divides p — 1. By the Riemann- 
Hurwitz theorem, Uj^s is an open subset (the complement of at most 8kL w 
points) of a curve whose genus is 0{k). (Here we use the fact that the 
etale morphism U-j^, -^U\% must extend to a tamely ramified cover of non- 
singular projective curves because its degree is less than p.) By the Weil 
bound for curves, (F p ) has p + 0(ky/p) points. As 2k divides p — 1, a 
regular semisimple element in a split maximal torus of SL2(F p ) which admits 

(2k) 

one 2/cth root admits 2k. (In terms of our diagram, is Galois over U\2-) 
Therefore, the image of Uy£$ (F p ) in [/-[^(Fp) has ? + 0{^/p) points. 

(2) 

The total number of points in (F p ) is p + 0(^/p). Each maps to 
u & IA (F p ) which is the trace of an element of the form x 2 in a split torus of 
SL2(Fp). This element is the rth power of a generator of the split torus for 
some r dividing p — 1 . We assume p = 3 (mod 4) , so the highest power of 2 
dividing r is 2. We would like to show that x can be chosen so that r = 2. 
It suffices to show that for every odd divisor k of p— 1, x is not a kth. power. 

(2) 

By hypothesis, k has no prime divisors < M w , so every point on t^ 12 3(Fp) 
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for which the claim is false belongs to Ik- Now, 

£ = 2*(^) + o(r(^)v^), 

where p, (ft, and r denote the Mobius, Euler, and number of divisors func- 
tions. As r(n) = o(n e ) for all e > 0, and <f){n) > n 1 ^ 6 for all large n, this 

(2) 

implies that points of the desired form exist. Let be one such, and 
let u be its image in U{¥ p ). Then u satisfies all the conditions needed to 
guarantee that it is the trace of an element of u;(SL/2(F p )) of order 2^-. 

□ 

5. Alternating groups, I: three prime theorems 

In this section we prove Theorems 11.31 and 11.91 The proof combines our 
result on SL 2 (Fp) from the previous section, with a certain approximation 
to the Goldbach conjecture in the form of a three prime theorem. 

To explain the idea, note that SL 2 (F p ) embeds in A p +\, and an element 
of order (p — l)/2 in SL 2 (F p ) has two non-trivial cycles and two fixed points 
in this embedding. Thus, if n = p + 1 and p satisfies the conditions of 
Theorem 14. 1( then we can produce a G w(A n ) with cyc(cr) = 4 (and with 
two fixed points). 

To extend this idea for arbitrary n we need to find suitable primes pi,P2, P3 
such that n = (pi + 1) + (p2 + 1) + (p3 + 1) + c where c is a small constant, 
and then to embed SL 2 (F pi ) x SL 2 (F P2 ) x SL 2 (Fp 3 ) in A n , which produces 
a permutation with a bounded number of cycles. Moreover, results from 
Additive Number Theory on the number of ways to write n in the form 
above will provide us with information on the size of w(A n ). 

We now give the detailed proof of Theorems 11.31 and 11.91 

Proof. We are given a non-identity word w. By Theorem 14. 1[ for every 
prime p > N w such that p = 3 (mod 4) and p — 1 not divisible by any odd 
prime < M w , there exist z Pt \, . . . , z p ^ £ SL 2 (F p ) such that w(z Pi i, . . . , z Pt d) 
is of order The action of SL^Fp) on F 1 (F p ) determines an injective 

homomorphism SL^Fp) — ► A p+ \. Let denote the image of z p ^. Thus 
w(C p ,i, • • • , Cp,d) consists of two ^= — cycles and two fixed points. 

Next we claim that for every integer N sufficiently large in terms of M = 
M w , there exists an integer N' in the interval [N — 11, N] such that N' — 3 is 
the sum of three primes pi such that p, > N w , pi = 3 (mod 4), and pj — 1 is 
not divisible by any prime in < M w . We do this by applying Ayoub's version 
of Vinogradov's three prime theorem |Ay| , asserting that every sufficiently 
large integer N can be written in at least cN 2 log -3 ./V ways as a sum of 
three primes lying in any specified residue classes a\, a2, (mod m) as 
long as iV is congruent to a\ + a 2 + 03 (mod m). For sufficiently large, 
this is enough to ensure that pi, P2, P3 can be chosen to be larger than N w . 
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We choose N' = 12[-/V/12j so that N' — 3 is the sum of three primes each 
congruent to 11 (mod 12). If pi is such a prime, p% — 1 is not divisible by 3. 
For every prime I greater than 3, every residue class (mod £) can be written 
as the sum of three residue classes from 2 to i— 1, so we can impose (mod £) 
conditions on pi, p2, P3 consistent with £ \ p% — 1 and p\ + P2 + P3 + 3 = N' 
(mod £). By the Chinese remainder theorem and Ayoub's result, the element 

w ((Cpi,l 5 Cp 2 ,l: Cp 3 ,l); • • • (Cpi.dj Cp 2 ,rf) (p 3 ,d)) e Api+1 x -4p 2 +l x -4p3+1 C 4lV' 

has at most 12 cycles including 6 fixed points, so by the inclusion Ajv' C An, 
there is an element in w(An) with at most 23 cycles, of which at most 6 are 
non-trivial. We can construct a sequence of elements on G ^4 at in this way 
for which the bound M tends to oo as N goes to do. 
This concludes the proof of Theorem 11.31 

For Theorem 1 1.9} we construct elements as above. The centralizer of such 
an element in A n has order at most 

17 >1+1) 2 (P2 + 1) 2 (P3 + 1) 2 = Q( 6) 

16 1 ' 

Therefore, the conjugacy class of such an element has 0(n _6 |^4 n |) elements. 
As we have 0(n 2 ~ e ) ways of choosing triples (pi,P2,P3), m all there are at 
least 0(n~ 4 ~ e \A n \) elements in w(A n ). 

□ 



6. Alternating groups, II: a combinatorial construction 

In general it is difficult to say when a given conjugacy class in a sym- 
metric group appears in the product of two other conjugacy classes. In this 
section we present an explicit construction that can be used to show that 
certain conjugacy classes are contained in the squares of other classes. This 
constitutes the elementary part of the proof of Theorem 11.11 

We denote the number of fixed points of a permutation a G S n by fix(cr) . 

Proposition 6.1. Let n be a positive integer and a.,/3 E S n permutations 
belonging to conjugacy classes A and B respectively. If 

fix(/3) > 7cyc(a), 

and P is even, then B C A 2 . 

Proof. Let [a, b] denote the sequence of consecutive integers a < x < b; such 
a sequence will be called an interval of length 1 + b — a. Given sequences 
Si, . . . , S r , let j(Si, . . . , S r ) denote their concatenation. If S is a sequence 
and k is an integer, Tk(S) is the sequence obtained by adding k to each 
element of S. If S is a fc-term sequence of distinct integers in [l,n], let (S) 
denote the corresponding /c-cycle in S n . Let = ao < ai < . . . < a m = n be 
defined so that 



7 = ([o + 1, ai])([ai + 1, a 2 ]) • • • {[a m -i + 1, a m ]) G A. 
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We want to find 5 6 A such that j5 £ B. 

Let 6fc denote the number of fc-cycles in (5. Let 

ci = b\ = fix(/3) > 7cyc(a) = 7m, 

C2 = &2, 

C3 = h + b 5 + b 7 -\ , 

c 4 = 64 + 6 6 + 6 8 H , 

d3 = C3, 

d 4 = Lc 2 /2J , 

^6 = C2 - 2(i 4 , 

4 = Lc 4 /2J. 

As 62 + 64 + + • • • is even, so is C2 + C4, so C2 and C4 are both even or both 
odd and C4 = 2ds + • 

Our first task is a packing problem: to find a set S consisting of dj intervals 
of length i for i = 3, 4, 6, 8, each contained in some [a k + 1, a k+ i], such that 
the resulting intervals are mutually disjoint. To show that such a packing 
is possible, we iterate, starting from k = and packing as many intervals 
of length i £ {3,4,6,8} as possible (up to a limit of di intervals of length 
i) into each interval [a k + 1, a k +i] in turn, starting from the right endpoint 
and working toward the left, so the union of intervals at any point in the 
construction is "right-justified." For each k, we denote by Ik,i,Ik,2, ■ ■ ■ 1 Ik,p k 
the successive elements of E in [a^ + 1, a>k+i] 

If the packing process terminates before all the upper limits di have been 
achieved, then the number of free spaces in all the [a^ + l,afc +1 ] combined 
is < 7m. Since 61 = fix(/3) > 7m, we have 

3d 3 + 4eZ 4 + 6<2 6 + 8dg = 2c 2 + 3c 3 + 4c 4 < b 2 + 63 + b 4 H < n - 7m, 

so 

\{(k,l): \I k ,i\ =i}\ =dt 

for i = 3,4,6,8. 

We define Ij. so that for each k from to m — 1, the union of the intervals 
4,1, 4,2, • • • , 4, Pfc is [h + !, Ofc+i]. We write e kji for the left endpoint of I k>i 
and define 

fr eM (2,0,l) |4,i| = 3 

^(3,0,1,2) |4,i|=4 
T eM (5,2,0,3,4,l) |4,il = 6 

T efc> . (7, 5, 2, 0,3, 6, 4,1) |/ fc)i | = 8 

Note in particular that the first term of S k ,i is the last term of 4,i- If 
|4,i| = 3 (resp. |4,i| = 6), we say that e^^ + 1 is a special point of odd 
type (resp. even type). Intervals of length 4 in S have no special points. If 
1 4,i I = 8, there are two special points of even type in the interval: e k; i + 1 
and e k i + 6. We assign each special point a label, i.e., a positive integer A, 



(6.1) S k , t 
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in such a way that there are &2A+1 special points of odd type with label A 
and &2A+2 special points of even type with label A. 

Let Xk denote the interval [at + 1, Ik] and Yj, the subset of consisting 
of ak + x such that x is odd and + x + 1 < Ik- Thus \Xk\ < 2|lfc| + 1. 
Setting 

x = x u x 1 u • • • u x m _i, y = y u y u • • • u y m _i, 

we have |X| < 2|y| + m. Thus, 

. , ^ n — 3d 3 — 4^4 — 6c?6 — 8c?8 — m n — 2c2 — 3c3 — 4c4 — m 

' ' - 2 ~~ 2 

(6.2) 1 1 00 . 00 . 

i=5 i=5 

There exists a permutation e £ S n with the following properties: 

• Every cycle of length > 2 contains exactly one element of [1, n] \ Y, 
and this element is special. 

• The length of a cycle containing a special point of label A is A. 



The existence of e follows from (|6.2p . 
For < k < m, we define a cycle 

&k = (j(Sk,i, • • • , Sk, Pk ,Rk)), 

where is the reverse of the sequence [o^ + Thus, SqSi •••<5 m _i 

belongs to A. We define 

5 = eS ■ ■ ■ (5 m __ie _1 G A. 

We claim 76 belongs to B, i.e., that it has exactly bj j-cycles for each j. As 
Y^j jbj = n, it suffices to prove that there are at least bj j-cycles for each j. 
There are four cases to check: j = 1, j = 2, j > 3 odd, and j > 4 even. 

For j = 1, we note that every element of x G Rk such that x and x — 1 
are fixed by e satisfies 7<5(x) = 7(2; — 1) = x. Thus, 



> n — 


E 






k,i 




= n — 




-4d 4 - 


= n — 


2c 2 


- 3c 3 - 


= n — 


26 2 


- 36 3 - 


= n — 


26 2 


-36 3 - 


= i»i. 







% - 4b 6 2|y \ fix(e) 
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For j = 2 we consider all I k) i €. £ which are of length 4. For each such 
interval, 7(5 maps 

ek,i ^ 7^fc(efc,i) = l{dk,i + 1) = e k ,i + 2 
&k,i + 1 •-»■ 7^fc(efc,j + 1) = 7( e fe,« + 2 ) = e fc,i + 3 
efc,, + 2^ 7<5fc(efc,i + 2) = 7(e fe)i - 1) = e k>i 
e k ,i + 3^ l5 k (e k ,i + 3) = 7(e fe)i ) = e fe ,, + 1. 



This produces a total of 2d,4 2-cycles. If there is an interval I k) i of length 6, 
then 7<5 maps 

e k ,i + 5 i-> 7(5 fc (e fci j + 5) = 7(e fc)i + 2) = e fe)i + 3 
efc,i + 3^ -yS k (e k ,i + 3) = 7(e fc)i + 4) = e fe)i + 5. 



Thus, 7(5 has at least 2c?4 + ^ = C2 = 62 cycles of length 2. 

If j > 3 is odd, let A = (j — l)/2. There are 6j special points of odd type 
with label A. Each belongs to an interval I kt i of length 3. The special point 
in such an interval is e k ,i + 1. Thus 7<5 maps 

e k ,i "-»■ e(ejfc,i + 1) + 1 
e r (e fc)i + 1) + 1h» e r+1 (e fcii + 1) + 1, 1 < r < A 
e k ,i + 2^ e fcji + 1 
e r+1 {e Ki + 1) ^ e r (e fcii + 1), 1 < r < A 
e(ejfc,i + 1) >-> e k ,i 



producing a j-cycle. It follows that 7(5 has at least bj j-cycles. 

If 7 > 4 is even, let A = j/2 — 1. There are special points of even type 
with label A. Each special point with label A in an interval I k ^ of length 8 
contributes a j-cycle because jS maps 

e k ,i ^ e k>i + 4 
e k ,i + 4^ e(e fcji + 1) + 1 
e r {e Kt + 1) + 1h» e r+1 (e M + 1) + 1, 1 < r < A 
efc,, + 2^ e k ,i + 1 
e r+1 (e fc ,i + 1) ^ e r (e fc>i + 1), 1 < r < A 
e(efc,j + 1) i-> e k ,i 
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and 

dk,i + 3 i-> e(e fc)f + 6) + 1 
e r (e fc , 4 + 6) + 1 ^ e r+1 (e M + 6) + 1, 1 < r < A 
efc,i + 7 h-> e kji + 6 
e r+1 (e M + 6) ^ e r (e M + 6), 1 < r < A 
e(efc,j + 6) i-> e fc)f + 5 
ek,i + 5 h-> efc,i + 3. 

Likewise, if ^, of length 6, has a special point with label A, then j5 maps 

e-k,i ^ efe.i + 4 
efc,j + 4^ e(e k ,i + 1) + 1 

e r (e M + + e r+1 (e fc ,i + 1) + 1, 1 < r < A 
efc,i + 2h> e fcji + 1 
e r+1 (e kA + l)^e r (e Ki + l), 1 < r < A 
e(ejfc,» + 1) i->- e fc)i . 

Thus, again we have at least bj j-cycles, and this finishes the proof. □ 



7. Alternating groups, III: characters and probability 

Here we prove a character bound for symmetric groups which may have 
some independent interest, and combine it with Proposition 16,11 and a prob- 
abilistic argument to prove that conjugacy classes of permutations which 
contain a small number of cycles have the property that their squares cover 
all of A n . 

This proves Theorem 11.11 and answers an old question of J. L. Brenner 
(but falls short of his strong conjecture [Br]). For our purposes, however, 
it is more than enough; combining it with Theorem 11.31 we deduce Theo- 
rem [T3J giving a best possible solution for Waring's problem for A n . This 
will complete the proof of all results stated in the Introduction. 

Recall that the irreducible characters x £ I rr {S n ) are parameterized by 
partitions A of n, which correspond to Young diagram. Let x\ De t ne char- 
acter corresponding to a diagram A. 

We define layers of a diagram A as follows. The first layer consists of 
the first row and column of A. Once we remove the first layer we obtain a 
smaller diagram A'. The layers of A are defined recursively as the layers of 
A' together with the first layer of A. 

We shall use the Murnaghan-Nakayama Rule |Jal 21.1]. By a rim r- 
hook v in a A-diagram, we mean a connected part of the rim containing r 
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nodes, which can be removed to leave a proper diagram, denoted by X\u. 
If, moving from right to left, the rim hook v starts in row i and finishes 
in column j, then the leg-length l{v) is defined to be the number of nodes 
below the ij-node in the A-diagram. 

Proposition 7.1. (Murnaghan-Nakayama Rule) Let pir £ S n , where p is 
an r- cycle and ir is a permutation of the remaining n — r points. Then 

XA(pvr) = ]T(-l)'H XAV ,(vr), 

where the sum is over all rim r-hooks v in a X-diagram. 

We shall now bound the character values of a permutation as a function 
of its number of cycles alone. 

Theorem 7.2. Let a € S n be a permutation with k cycles (including 1- 
cycles). Then 

\ X {o-)\<2 k - l k\ 
for all irreducible characters x of S n . 

Proof. We argue by induction on k. For k = 1 our permutation a is an n- 
cycle, and it is a well known consequence of Proposition 17. II that Ixt^)! ^ lj 
and moreover, if x(°~) then the Young diagram of x consists of a single 
layer. 

Suppose now that k > 2 and let X = Xx- Write a = pir where p is an 
r-cycle and n is a permutation on the remaining n — r points. We now apply 
the Murnaghan-Nakayama Rule above. Since ir decomposes into k—1 cycles 
induction yields 

IxavOOI <2 k - 2 {k-l)\ 

for all rim r-hooks v in the diagram A. 

To complete the proof we may assume X\{°~) 7^ 0- 

Claim: The number rim r-hooks in A is at most 2k. 

To show this first note that, by repeatedly applying the Murnaghan- 
Nakayama Rule, we see that A has at most k layers (otherwise Xa(°") = 0). 

Now, each rim r-hook v of A has a starting point, its rightmost upmost 
point, and an end point, its leftmost downmost point. Each of these points 
determines the rim r-hook uniquely (by going r steps on the boundary in 
the suitable direction). We define the starting point and the end point of 
each layer of the diagram A in a similar manner. 

Since the remaining part \\v of the diagram should be connected, either 
the starting point of v is the starting point of some layer of A, or the end 
point of v is the end point of some layer of A. Since there are at most k 
layers, it now follows that the number of rim r-hooks in A is at most 2k, 
proving the claim. 
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The Murnaghan-Nakayama Rule now expresses Xa(c) as a sum of at most 
2k terms, each of absolute value at most 2 k ~ 2 (k — 1)!. This yields 

IxaWI <2k-2 k ~ 2 (k -l)\ = 2 k ~ l k\, 
completing the proof of the theorem. 

□ 

Note that |x(°")| may be close to (kl) 1 ^ 2 , for instance take a = 1 (so k = n) 
and x a character of highest degree. This shows that the upper bound in 
Theorem 17.21 cannot be significantly improved, at least for large k. 

Given conjugacy classes Cj = xf (i = 1, 2) in a finite group G, let Pc 1; C 2 
denote the distribution of the random variable y = yiy2, where yi £ Ci is 
randomly chosen with uniform distribution on Cj. The following is standard. 

Lemma 7.3. With the above notation we have, for g £ G, 

p < n \ in-i V" x{xi)x{x 2 )x{g' 1 ) 

Pci - C2(5) = |G| j- (G) — m — • 

Xelrr(G) 

For a finite group G define 

Cg(s)= Y, XW~ S 

where s is a real number. 

We need the following two results from |LiSh2j . related to character de- 
grees. 

Lemma 7.4. For s > we have Cs„( s ) = 2 + 0(n~ s ). 

Lemma 7.5. Let A = Ai, . . . , A m be a partition of n such that Ai > m. Let 
t = n — Ai . Then 

r»;xA(i)>cr*)- 

(ii) If t > en for some e > then Xa(1) — c " where c > 1 depends on e. 

We can now prove Theorem ll.il 

Proof. Throughout this proof, we assume without further comment that n 
is sufficiently large. Let a be a permutation in S n with 

cyc(a) < n 1 / 128 . 

We have to show that (a Sn ) 2 = A n . 
We first claim that 

(7-1) \ X (a)\ <x(l) 1/63 , 

for all irreducible characters x of S^. 

This is clear if x(l) = 1 so suppose x = Xa is a non-linear character, 
where A = Ai, . . . , A m is a partition. Since we are only interested in absolute 
values, we may replace a partition by its transpose and assume that Ai > m 
(so Lemma 1731 is applicable). 
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Write X\ = n — t and k = cyc{a). Then Theorem 17.21 and our assumption 
on k yield 

(7.2) |XA(a)|<2^ 1 A:!<(n 1 / 128 r 1/128 . 

If t > re/3, Lemma 17311 1) gives an exponential lower bound on xa(1) ; so 
CLU) follows. 

If n 1 / 127 <t< n/3, then Lemma l7.5( i) yields 



X(l) > ( V) > 2* > 2 



Combining this with (|7.2j) we obtain (|7.ip . 

Finally, suppose 1 < t < n 1 / 127 . Let / denote the number of fixed points 
of a. We use (HSl (14)] to show that 

ixmis* e (0(*;0 sfl ^C)- fl - (t+1) -*'- 

a,6>0,a+2fc<t v 7 v 7 0<i<t v 7 

Using the bounds k < ko : = n 1//128 and t < to : = n 1 / 127 this gives 
IxWI < (fcfto)' < n^s+i/m)^ 
It is easy to see that for i < n 1 / 127 and fixed 5 > we have 

n (l-l/127-5)t < - t N 

Choosing 5 such that 1/128 + 1/127 = (1 - 1/127 - <5) ■ 1/63 and noting that 
5 > it follows that 

1/63 



V) ^ * (i)1/63 - 



This completes the proof of (17.ip . 

Now, let C = o" 5 " and let 7r denote an even permutation. 

If 7T has more than 7n 1 ^ 128 fixed points, then we have tt £ C 2 by Proposi- 
tion [67TJ We therefore assume from now on that fix(7r) < 7n 1 / 128 . By \M.S\ 
Theorem B], this implies 

Wvr- 1 )! < |x(l)| 30/31 - 

By Lemma 17.31 we have 

X^Mrr- 1 ) 



Pc,c 



X€lrr(S„) 



The condition on the sign of tt shows that the contribution of the linear 
characters of S n to Pc,c( 7r ) is 2/n!. On the other hand, 

y x^MO < y (x(i) 1/63 ) 2 x(i) 30/31 = y 
x(l) x(l) 

{X:X(1)>1} AV ^ {X:X(1)>1} AV 7 {X- xM>l} 



28 MICHAEL LARSEN AND ANER SHALEV 

where s = 1/31 — 2/63 > 0. Combining this with Lemma 17.41 we conclude 
that 

\P C> c(*) ~ 2/n!| < -A( Sn (s) - 2) = 0(n- s /n\). 
nl 

This implies Pc,c(^) > 0, so it G C 2 . 
The theorem is proved. 

□ 

Combining various tools above, we can finally prove Theorem 11.51 

Proof. We have to show, given words W\,W2 ^ 1, that wi(A n )w2(A n ) = A n 
for all n > N(wi,W2)- 

We use Theorem ll.3l and its notation. Let N = N(wi,W2) ■= maxj = i 2 N(wi), 
and let n > N. Then Theorem 11.31 shows that there exists a permutation 
a n £ wi(A n ) CiW2(A n ) consisting of at most 23 cycles (of which at least 6 are 
of length 1. so the A n -conjugacy class of a n is the same as its 5 n -conjugacy 
class). Clearly a^ n C w\{A n ) n W2{A n ) and by Theorem ?? we have 

w 1 {A n )w 2 {A n ) D ( Wl (A n ) n w 2 (A n )) 2 D (erf") 2 = A n . 

The theorem is proved. 

□ 



8. Intersection theorems 

Our methods enable us to deduce somewhat surprising results concerning 
intersections of the form n^ =1 Wi(G), where wi,...,Wk are given words. Note 
that in free groups such intersection may well be trivial. However, we can 
prove that in the groups considered in this paper such intersections are very 
large. More precisley, we have 

Theorem 8.1. Let k > 1 and let wi, . . . ,Wk be non-trivial words. 

(i) There exists N = N(w\, . . . , Wk) such that for all n > N we have 

{wi{A n )r\...r\w k {A n )f = A n ; 

(ii) For every e > there exists N = N(w\, . . . ,W}~, e) such that 

\wi{A n ) n . . . n w k {A n )\ > n- 4 - e \A n \. 

Indeed, part (i) follows from Theorems 11.11 and 11.31 using the fact that, 
for n > 0, a** C n^ =1 Wi(A n ). 

The proof of part (ii) is similar to that of Theorem 11.91 

Our next result is an intersection theorem for groups of Lie type. 

Theorem 8.2. Let k > 1 and let w\, . . . ,W}- be non-trivial words. Let G a 
simply connected almost simple algebraic group over a finite field ¥ q . 

(i) There is a positive constant c depending only on w\, . . . ,w k and dim G 
such that 

\wi(G(¥ q )) n . . . n w k (G(¥ q ))\ > c\G(¥ q )\. 
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(ii) There is a number N depending only on w±, . . . ,Wk such that, if G 
has rank r and is not of type A r or 2 A T , and \G(¥ q )\ > N, then 

K(G(F ? )) n . . . n w k (G(¥ q ))\ > cr~ 1 |G(F (? )|, 

where c > is an absolute constant. 

(Hi) There is a number N depending only on w±, . . . , wt and dimG such 
that, ifT = G(¥ q ) /Z(G(¥ q )) is a finite simple group associated with G, and 
\T\ > N, then 

(w 1 (T)n...nw k (T)) 2 = T. 

The proofs of all three statements are modelled on the k = 1 case, replac- 
ing the morphism w : G n — ► G (or in the case of (ii), w: SL?? — ► SL2) with 
the fiber product, relative to G, of the morphisms Wi : G ni — > G (respectively 
W{: SL^ 1 — > SL 2 ). 

References 

[AH] Z. Arad and M. Herzog (Eds), Products of Conjugacy Classes in Groups, 

Springer Lecture Notes 1112, Springer- Verlag, Berlin, 1985. 

[SGA 43] M. Artin, A. Grothendieck and J.L. Verdier, Avec la collaboration de P. 

Deligne et B. Saint-Donat, Theorie des topos et cohomologie etale des schemas. 
Tome 3. Seminaire de Geometrie Algebrique du Bois-Marie 1963-1964, Lec- 
ture Notes in Mathematics, Vol. 305. Springer- Verlag, Berlin-New York, 1973. 

[Ay] R. Ayoub, On Rademacher's extension of the Goldbach-Vinogradoff theorem, 

Trans. Amer. Math. Soc. 74 (1953), 482-491. 

[Bo] E. Bertram, Even permutations as a product of two conjugate cycles, J. Comb. 

Th. Ser. A 12 (1972), 368-380. 

[Bo] A. Borel, On free subgroups of semisimple groups, Enseign. Math. 29 (1983), 

151-164. 

[Br] J.L. Brenner, Covering theorems for finite nonabelian simple groups. IX. How 

the square of a class with two nontrivial orbits in S n covers A n , Ars Combi- 
natona 4 (1977), 151-176. 

[De] P. Deligne, La conjecture de Weil. II. Inst. Hautes Etudes Sci. Publ. Math. 52 

(1980), 137-252. 

[EGH] E.W. Ellers, N. Gordeev and M. Herzog, Covering numbers for Chevalley 

groups, Israel J. Math. Ill (1999), 339-372. 
[EGA IVi] A. Grothendieck, Elements de geometrie algebrique. IV. Etude locale des 

schemas et des morphismes de schemas I, Inst. Hautes Etudes Sci. Publ. Math. 

20 (1964), 259 pp. 

[EGA IV2] A. Grothendieck, Elements de geometrie algebrique. IV. Etude locale des 

schemas et des morphismes de schemas II, Inst. Hautes Etudes Sci. Publ. 

Math. 24 (1965), 231 pp. 
[EGA IV3] A. Grothendieck, Elements de geometrie algebrique. IV. Etude locale des 

schemas et des morphismes de schemas III, Inst. Hautes Etudes Sci. Publ. 

Math. 28 (1966), 255 pp. 
[EGA IV4] A. Grothendieck, Elements de geometrie algebrique. IV. Etude locale des 

schemas et des morphismes de schemas IV, Inst. Hautes Etudes Sci. Publ. 

Math. 32 (1967), 361 pp. 
[ET] P. Erdos and P. Turan, On some problems of a statistical group theory. I, Z. 

Wahrscheinlichkeitstheorie Verw. Gabiete 4 (1965), 175-186. 



30 



MICHAEL LARSEN AND ANER SHALEV 



[Hu] D. Husemoller, Ramified coverings of Riemann surfaces. Duke Math. J. 29 

(1962), 167-174. 

[Ja] G.D. James, The representation theory of the symmetric groups, Lecture Notes 

in Math. 682, Springer- Verlag, Berlin, 1978. 
[J] G.A. Jones, Varieties and simple groups, J. Austr. Math. Soc. 17 (1974), 163- 

173. 

[L] M. Larsen, Word maps have large image, Israel J. Math. 139 (2004 ), 149-156. 

[L2] M. Larsen, How often is a partition an n'th power?, arXiv: math. CO/9712223 

[LP] M. Larsen and R. Pink, Finite subgroups of algebraic groups. Preprint, 1999. 

[LL] R. Lawther and M.W. Liebeck, On the diameter of a Cayley graph of a simple 

group of Lie type based on a conjugacy class, J. Comb. Theory, Ser. A 83 

(1998), 118-137. 

[LiSh] M.W. Liebeck and A. Shalev, Diameter of simple groups: sharp bounds and 

applications, Annals of Math. 154 (2001), 383-406. 
[LiSh2] M.W. Liebeck and A. Shalev, Fuchsian groups, coverings of Riemann surfaces, 

subgroup growth, random quotients and random walks, J. Algebra 276 (2004), 

552-601. 

[Lish3] M.W. Liebeck and A. Shalev, Fuchsian groups, finite simple groups, and rep- 
resentation varieties, Invent. Math. 159 (2005), 317-367. 
[MZ] C. Martinez and E.I. Zelmanov, Products of powers in finite simple groups, 

Israel J. Math. 96 (1996), 469-479. 
[Ma] H. Matsumura, Commutative Algebra, W. A. Benjamin Co., New York, 1970. 

[MS] T.W. Miiller and J-C. Schlage-Puchta, Character Theory of Symmetric 

Groups, Subgroup Growth of Fuchsian Groups, and Random Walks, preprint, 
arXiv: |math.GR/0305260| 



[Na] M.B. Nathanson, Additive Number Theory: the classical bases, Graduate Texts 

in Mathematics 164, Springer, 1996. 
[NS1] N. Nikolov and D. Segal, On finitely generated profinite groups, I: strong 

completeness and uniform bounds, to appear in Annals of Math. 
[NS2] N. Nikolov and D. Segal, On finitely generated profinite groups, II: product 

decompositions of quasisimple groups, to appear in Annals of Math. 
[Pi] R. Pink, Compact subgroups of linear algebraic groups J. Algebra 206 (1998), 

no. 2, 438-504. 

[SW] J. Saxl and J.S. Wilson, A note on powers in simple groups, Math. Proc. Camb. 

Phil. Soc. 122 (1997), 91-94. 
[Sh] A. Shalev, Word maps, conjugacy classes, and a non-commutative Waring- 

type theorem, to appear in Annals of Math. 
[Ti] J. Tits, Classification of algebraic semisimple groups, in Algebaic Groups and 

Discontinuous Subgroups (Proc. Sympos. Pure Math., Boulder, Colo., 1965), 

Amer. Math. Soc, Providence, R.I., 1966, pp. 33-62. 
[W] J.S. Wilson, First-order group theory, in Infinite Groups 1994, de Gruyter, 

Berlin, 1996, pp. 301-314. 
E-mail address: larsen@math.indiana.edu 

Department of Mathematics, Indiana University, Bloomington, IN 47405, 
U.S.A. 

E-mail address: shalev@math.huji.ac.il 



Einstein Institute of Mathematics, Hebrew University, Givat Ram, Jerusalem 
91904, Israel 



